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(54) Title: VOICE ACTIVATED DIALING WITH REDUCED STORAGE REQUIREMENTS 
(57) Abstract 



A voice activated dialing system (30) includes a database 
(36) maintaining a plurality of speech templates, associated 
telephone numbers, and recordings, wherein multiple speech 
templates can be linked to a single master recording. Process- 
ing circuitry (34) compares an utterance from a user to one 
or more of the templates from the database to find a match- 
ing template. When a match is found, the processing circuitry 
retrieves the telephone number associated with the matching 
template and the corresponding recording. The recording is 
replayed to the user prior to initiating a connection to the as- 
sociated telephone number. Master recordings can be used for 
multiple entries which are associated with a single telephone 
number to reduce storage requirements. 
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VOICE ACTIVATED DIALING WITH 
REDUCED STORAGE REQUIREMENTS 



BACKGROUND OF THE INVENTION 

1. TECHNICAL FIELD 

5 This invention relates in general to telecommunications and, more 

particularly, to voice activated dialing. 

2. DESCRIPTION OF THE RELATED ART 

Voice activated dialing has become a popular feature in many telephone 
systems. With voice activated dialing, a user can call a destination number without 
10 the need to look up the number or press any keys. 

■ ~ - Typically, a caller can"initiate"aT:onnection- through^^imple command such ~as 

"call the boss" or "call hoih^;" In 3 thh 1 exaiiiLplg7 /> cafi il/ is a command recognized by 
' the voice activated dialmg"sy^t6m (typidafiy thfbiigh 'Speaker independent vbice ' 
recognition) and "the boss" or "home" are destinations enrolled by the user. Discing 

15 enrollment, the user speaks the phrase which will be used to designate a destination 
and provides a number, eith^erltbrough the telephone keypad or by speaking the 
numbers into the phone (which are recognized using speaker 1 independent voice 
recognition). The phrase is digitally recorded by the vdice activated dialing sy stein 
and is also reduced to a template (typically using linear predictive coding) for speech 

20 recognition purposes. \ . 

After enrollment, if the user initiates a call, for example, "call the boss", the 



system will compare the utterance with all the voice dialing templates associated 
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with that user to determine whether there 1 is a Snatch. If not/ the system will ask the 
user to repeat the request. If a match is found>;the system .will respond with "calling 
the boss", where "the boss" is repeated in the users own voice using the recording 
stored along with the template, during enrollment. This confirmation gives the user 
an opportunity to abandon the call if the voice activated system does not properly 
recognize the command. - *. 

While the confirmation is an important feature of the voice activated dialing 
system, since the matching process is not perfect and incorrect matches can occur, its 
cost in storage space and time is significant. While the template is fairly small, 
typically .1000 bytes or less, ajcompr^s^recprdi larger 
amount of storage in order tQ reiaiiV3n©Mg^s;peech data to re-create an acceptable 
recording for p&ybad^ ^ §tprage,r£quired fojlarge - 

voice activated teriiplaferdii^to^ for-the 
corresponding recordingis. Frequently, because of their size,, t4iecrecprdings:are not 
stored on the system doing the recognition, but must be downloaded from another 
system. This introduces call setup delays which are annoying to users. 

Naturally, this is a significant problem when large directories are used. The 
problem increases when longer utterances are recorded to accompany the templates. 
For a 1.5 second recording at 16,000 bits/sec, 24,000 bits or 3000 bytes are needed. 
20 For a higher quality recording at 64,000 bits/ sec, 12000 bytes are necessary for the 
same recording. 

While voice activated dialing should be encouraged to promote efficiency in 
the workplace, particularly when work includes calling a large number of numbers, 
the costs of storage can mean that workers must reduce their use of voice activated 
25 dialing. Typically, workers will eliminate the least used numbers from their voice 
activated dialing lists; it is these numbers that are the least likely to be memorized, 
requiring the workers to look up the numbers, which is inefficient. 

Therefore a need has arisen for a voice activated dialing system for large 
dialing directories with reduced storage requirements. 

2 
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BRIEF SUMMARY OF THE INVENTION , r: r 

- A voice activated dialing System includes a database maintaining a plurality 

of speech templates, associated telephone numbers; and recordings, wherein 
multiple speech templates can be linked to a single recording. - Processing circuitry 
5 compares an utterance from a user to one or more of the templates from the database 
to find a matching template. When a match is found, the processing circuitry 
retrieves the telephone number associated with the matching template and the 
recording linked to the matching template. The recording is replayed to the user 
prior to initiating a connection to the associated telephone number. 

10 The present Mvimtion p^^ 

First- the storage requirements, and hence theorist; of the systenvcan b^ greatly 
reduced -because the 'number 6£ recordings ulsed ~#vthe s^stenrcaribe greatly 
reduced. Second/ b^attee it is possible to store 

the ^cordiilgs locai to tfa&prbc&sm the responsiveness 

15 -of the systems o^wh . n a -<i:ri ^,n:t^s:^ :cli ri ' . -' c 

.-'^ r. - "* r.ioiH'-v rrj (eh 'iisitt 1 1 v. c ~ • . • ; * . > 
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BRIEF DESCRIPTION OF THE SEVERAL VIEWS OF THE DRAWINGS 

For a more complete understanding of the present invention, and the 
advantages thereof, reference is now made to the following descriptions taken in 
Conjunction with the accompanying drawings, in which: 

Figure 1 illustrates a prior art structure for voice activated dialing; 

Figure 2 illustrates'a structure for voice activated dialing to reduce storage 
requirements and increase speed; and - ? "; f - " - ■ 

. . t Figyre 3 illustrates a flow chart which could be used for enrollment using the 
.voice activated dialing ^ stmcture pf Figure 2. . 



'it r' 
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DETAILED DESCRIPTION OF THE INVENTION 

The present invention is best understood in relation to Figures 1-3 of the 
drawings, like numerals being used for like elements of the various drawings. 

Figure 1 illustrates a depiction of a prior art voice activated. dialing system 10. 
5 It should be noted that this structure has been greatly simplified to show the 

important features for illustration purposes. The voice activated dialing system 10 is 
coupled to a telecommunications system 12 (which may be a local telephone system, 
a long distance carrier, private branch exchange (PBX), computer-network or other 
system). The voice activated dialing system 10 includes processing circuitry 14 and 
10 database circuitry l6 (which may be a central database shaired *tetw£en 'multiple 

processing circuits 14). Processing circuitry 14 includes database interfece circuitry 
18 to communicate with the database circuitry 16. 

The database circuitry 16 may be a centralized database or a distributed 
database. In a centralized database system, multiple processors 14 access a common 
15 database with information for all users in the common database. In a distributed 

database system, each processor has an associated database which contains a portion 
of the information of the overall database; if information is needed for a user which is 
not stored locally, it must be downloaded from another distributed database, or from 
a centralized backup database. 

20 In operation, the voice activated dialing system 10 receives commands from 

users for dialing instructions. The commands, given verbally by users, are received 
by the processing circuitry 14, where the utterance is conditioned (typically into 
linear predictive coding vectors) and compared to the user's templates in database 
circuitry 16. It is assumed that the speaker's identity (for purposes of downloading 

25 the correct templates from the database circuitry 16) is known to the processing 
circuitry 14 either by the identity of the phone extension, a user logon, or other 
means. The templates are then compared with the linear predictive coding vectors 
derived from the utterance, and the closest match is found. The database circuitry 16 
stores a corresponding telephone number and recording for each template, which are 

30 retrieved from database 16 to initiate the connection. 

5 



BNSDOCID <WO„ 0039982A2. \ > 



WO 00/39982 PCT/US99/31089 
As an illustrative example, assume MtUser2 initiates a voice activated 
dialing session by "speaking "call store 102" into nis phone. The processing circuitry 
14 digitizes User2's speech and transforms it into LPC (linear predictive coding) 
vectors. These vectors are compared with all the templates for User2 in the database 
5 16. If a match is found, the recording associated with the matching template is 
downloaded along with the telephone number associated with the matching 
- templates In this example, the matching template should be,<template 1002>. Thus, 
recordings "Store 102"-user2> should be downloaded along with the number "214- 
555-1112"- Theprocessing circuitry will announce "calling" and replay the recording 
lO t .< "Store 102"-user2>, which is. the recording made from User2's voice at the time that 
User2 enrolled "Store 102" into the v4c§.ac^va^.d^alirig : sysl^.l0. Unless User2 
. aborts the-call, the ; prqcessingck 
- telecommunications §ysltem42 to.; initiate if 1 ? : c PB?5 ? ^? n *?; S ^a *°?7r .• :., 

As shown in Figure % the one-tonSrie corf espbnderice between templates ' and 
15 recordings causes the systemlu'to Have signfficart iricreases in storage requirements 
each time a new voice activated dialing entry is a'dded,' since each new entry adds a 
new digital recording: ' Further, there is a significant delay in initiating the ■ 
connection the recording has to'be downloaded from another systems - • 



:i 



Figure 2 illustrates, a^yoice activated dialing system 30 whiph can dramatically . 

20 decrease the amount of storagef for^vpice activated dialing and which also incr.ea.ses 
the responsiveness of the system. As in Figure 1, the voice activated dialing, system 
30 is coupled to a telecommunications system 12 (which may be a local telephone 
system, a long distance carrier; private branch exchange (PBX), computer network or 
other system). The voice activated dialing system 30 includes processing circuitry 34 

25 and database circuitry 36 (whicn may be shared between multiple V AD systems). 

Processing circuitry 34 includes database interface circuitry 38 to' communicate with 
the database circuitry 36. 

The voice activated dialing system 30 uses a many-to-one relationship 
between me templates and me recordings.' Accordingly, when multiple users each 
30 have voice activated dialingentries for "the same telephone number, the templates for 

6 
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each user remain unigue (as is necessary for proper speaker dependent speech 
recognition), but a single master recording for the telephone number is replayed 
during the confirmation stage.- Accordingly, the number of digital recordings stored 
in the voice activated dialing system 30 is determined by the number of unique 
5 telephone numbers, rather, than by the number of enrolled voice activated dialing 
entries. , k 

Using the example set forth above, User2 once again initiates a voice activated 
dialing session by speaking "call store 102" into his phone. The processing tircuiitry 
34 digitizes User2's speech and transforms it ihto LPG (lirrfeat 1 predictive coding): 
10 vectors . These vectors are compare*! With kll the templates for User2«in the database 
36. If a match is found/ tHe recording associated with the matchirig tfeitiplate is 
retrieved alongHvith the tefephoife riutnbeFa<^ 

In this example, £he'^ J fc1^^^mpl^ sfto^ r bS ^lerftf>lSt£ I002>, as in the example 
above. In tfcis c^se, however, the recording <"Store 102"-master> is retrieved along 

15 . \ with the number " 2X^^5-^X1^!' t: ^ln many cases,, the recordings for the overall 
database pan be store^pc^Uy^.due to the smaller ^storage requirements for the 
recordings., The processing circuit^^will announce "calling" and replay the 
recording <'Store 102"-master>. Unless User2 aborts the call, the processing 
circuitry 34 will provide the dialing information to the telecommunications system 32 

20 to 'initiate the' connection to St6f e T02. -M caSes- Where the ^econaSEngsrorat least the 
master recordings, are local to the processor 34/ the confirming recording can be 
played alrnost immediately &f i er the recc%riition. "~ Y - ' * : o 

: :[ * " r /i v/ '^.' ;t r • ■ *■ ^ 

If User 1 initiates a voice activated dialing session by speaking "call store 102" 

into her phone, the voice activated dialing system 30 digitizes Userl's speech and 

25 transforms the utterance into LPC vectors, to compare to Userl's templates. After 

matching the utterance with template <Template2>, the system will once again 

retrieve the <"Store 102"-master> recording along with the telephone number "214- 

555-1212". 

On the other hand, if User2 initiated a voice activated dialing session by 
30 stating "call home", the confirmation process would retrieve a unique recording 
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<"home-user2> for repeating to User2 prior to making the connection. If Userl 
initiated a voice activated dialing session by stating "call home", the confirmation 
process would retrieve a unique recording <"home-userl>. This is because the 
telephone numbers associated with the two voice activated ^dialing entries are 
5 different, even though the verbal command to initiate .the voice activated dialing 
sessions is. the same. . - . .•-■* ~ .-.::,-;■=- : „ 

Accordingly, the voice activated dialing system 30 described above can gireatly 
reduce storage requirements in certain situations where multiple callers each have 
entries for the same destination number. A situation like this would occur, for v 
10 . example, in a chain of stores where calls are frequently placed between the stores. It 

■ ' : *-!•".- " .,;r -:-p.\ r) j 2 ~; - L ,7i * - ,J -\ , - v 

would also be useful in an office environment where each employee has voice 

" ■ " r ^dr rr i j ~ \rr- i_ r:o to • -v.; . . • • 

activated dialing entries for a large number of the other employees, such that 

extension numbers dp not heed to be memorized. 

Further, if the entire database can be stored locally, or if the recordings can be 
15 stored locally, the responsiveness of the system can be greatly enhanced. In some 
systems where there is not enough local storage to store all recordings locally, the 
master recordings may be stored locally for faster response in connection with the 
most frequently used recordings. 

It should J be^ribtfectth^t the master recordings are associated with the- ( 
20 telephone humbets; not the Cdntmand represented by the^template; For example, 
Userl could refer to'-'Store 102" as "the Oklahoma Store" in her voice activated . 
dialing. To initiate a ^oice activated dialingtfo call "Store„102", Userl would say "call 
the Oklahoma 'store" and the voice activated-dialing system 10, after correctly 
matching the utterance to template ;<Template2>, would respond with "Calling Store 
25 102", where "Store 102" was recorded in <"St6re 102-master">. Thus, the command 
' * "used by an individual can vary from the recording used in the confirmation process. 

The making of a master recording, i.e., a recording associated with two or 
more voice activated dialing entries, can be done in a number of ways. One way 
would be to notify the system administrator when redundant entries (multiple 

8 
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entries to the same telephone number) were found in the database 36. The system 
administrator could then make a master recording to which each of the redundant 
entries would be linked through the database 36. 

A second method r of enrolling new entries is shown in Figure 3: In block 50, 
5 processing circuitry 34 identifies the user arid prompts the user for a new voice 
activated dialing entry. In block 52, the entry spoken by the user is received and 
recorded and the telephone number associated with the entry is entered by the user, 
either by voice commands or the touch tone keypad. In decision block 54, the 
processing circuitry 34 searches the database 36 for other references to the telephone 
10 ; number entered by the user. If no redundant entries are detected, the recording is 

saved in die database 36 and linked to the entry in block 56. On the other Rand, if the 
processing circuitry 34 detects on or more references to the telephone number 

entered by the user, it links the entry to the previously enrolled entry in block 60. At 

.kosh^n-srr: c-i ton o" " • 

a later time, the recording could then be changed, if desired, by authorized 

15 : personnel. u \v ... r -yzi^-.. {••• ~ 

The storag;e sayings can be' significant.' In a system where 1000 users 
frequently^ call the same 2000 destinations, there would be up to 2000 templates for 
each of 1000 users, plus up to 2000 recordings for each of 1000 users/ The template 
size, assuming use of a 1000 byte template, would be 1000 users X 2000 

20 templates/ user X 1000 bytes/ template, or 2. gigabytes Qf4:emplat§ information. Even 
with speech compression* the'sizeiof tihje confimung riecojdings .would be r 
approximately 1000 usersX2000 recordings/ uset X 16>000 bit/ sec X 1.5 
sec / recording X 1 byte/8 bits; or 6 gigabytes; ? If the recording was performed at; 64k 
bits per second, the require recording storage would be 24 gigabytes. By reducing 

25 the 2 million recordings to 2000^'ecordings by using shared^naster recordings, the 
r storage requirements could be reduced byla factor of 1000 to 6 megabytes (24 

megabytes for 64 kbit/sec recordings). This, relatively, small -amount of storage can 
easily be accommodated locally. The templates, which are significantly smaller, can 
be stored locally if possible or centrally if necessary ^ L 
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The present invention provides several advantages over the prior art. First, in 
many situations, the number of recordings staffed on the system can be greatly 
reduced, thus reducing storage requirement and, consequently, the cost of the voice 
activated dialing. Sfecond, since the recording storage is greatly reduced, the 
5 recordings can be maintained locally, increasing the speed of retrieval and the 
responsiveness of the system. ' * ■■ 

Although the Detailed Description of the invention has been directed to 
certain exemplary . embodiments, various modifications of these embodiments, as 
well as alternative embodiments, will be suggested to those skilled in the art, Jhe 
10 inventipn: encompasses any modifications or alternative embodiments that fall within 
the sc.opeof the Claims.- ; . x j: : :*r ^r^ru! o .: - 
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CLAIMS 

.1. A, voice activated dialing system, comprising: 
a database including a plurality of speech templates, associated telephone 
numbers, and recordings, wherein multiple speech templates can be linked to a 
5 single recording; and ....... 

processing circuitry for: 

comparing an utterance to one or more of said templates from said 
database to determine a matching template; - > ^ - 

retrieving the telephone number associated with the matching - - 
10 template; ' ' *" f ; - ^ , : . : v ,\. ^ ■ ':/<■ 

retrieving the recording lihked fb the matching template; and; i 
replaying said recording to the user prior to initiating a connection to 
said associated telephone number. 

2. The voice activated dialing system of claim 1 wherein said recordings 
15 are stored local to the processing circuitry. 

3. The voice activated dialing system of claim 1 wherein said templates 
are stored local to the processing circuitry. 

4. The voice activated dialing system of claim 1 wherein said templates 
are stored in a centralized database. 

20 5. The voice activated dialing system of claim 1 wherein said processing 

circuitry identifies multiple associations of multiple templates to a common 
telephone number. 

6. A method of performing voice activated dialing, comprising the steps 

of: 

25 storing a plurality of speech templates, associated telephone numbers, and 

associated recordings, wherein multiple speech templates can be linked to a single 
recording; and 

comparing an utterance to one or more of said templates from said database in 
a processor using speech recognition techniques to determine a matching template; 

11 
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retrieving the telephone number associated with the matching template; 
retrieving the recording linked to the matching template; and 
replaying said recording to the user prior to initiating a connection to said 
associated telephone number. _ . . _ 



5 7. The method of claim 6 wherein said storing step includes the ? step of 

storing recordings local to the processing circuitry. , , 



I 8. : The method of claim 6 wherein said storing step includes the jsiep of 
> * stbring tem^pjates loc^ldthej>rocessin^circuitay. ~ 

- f . !i i ; *. - -3 . • ■* % i 

i - • "s r i: *' • - ■ ~* ' ^ " ' • • 

1 I „ 

, - - TRelfiethod pf claim 6 wherein said sToring step includes the §tep of 

10" storisig t^mi^^^ ' " ■ - *. " 



~ 10. T3*e^ejfrc^^^ 
multiple associations, of multiple teriaplatesio a common tel^phcm 

11. - -The method bf d^te l fr^ the 

step of receiving a- telephone number^rom aniserxluring enrollment oi a new voice 

I ~ — V > 5 \f£: »cc.;-0: r ! < V 0 . • = .?3 : " 

15 » aetivated^iaiing^ntry and eomparing-*he- telephone number-to other of said: 

-telephone- numbers.-^ | - — ~ — - 



12 



BNSDOCID: <WO 0039982A2„L> 



WO 00/39982 



PCT/US99/31089 



1/3 





16 : 
\ 


■ FIG. 1 

(PRIOR ART) 


10 

/ 


USER 


TEMPLATE 


NUMBER 




RECORDING 


Userl r '- 


<Templbtei>~ 


214-555-1111 




<"Store 1 0 1* -userl > 


Userl 


<Template2> 


214-555-1112 




<"Stbre 102"-user1> 


Userl 


<Templote3> 


.214-555-1.113. 




<"Store 103"-user1> 




Userl 


<Template4> 


214-555-1114 




<"Store .104-:-user.l> 


Userl 


<Template5> 


214-555-5555 




<"Home ,, -user1> 


• • 








i'-r. \,.\. 


User2 


<Templote1001> 


214-555-1111' ' 




' < M Stbre lbl"-user2> 


.U?er2-, D . 


<Ternplate-1 Q,02> -> - t 






=;r,<-'Stpre 102"-user2> 


User2. 


, <Templote1003> 


r 2 1 4;— 555— *1 1j1 3 ' f. 


— : — 


<"Store 103 ,f -user2> 


User2 


<Templote1004> 


214-555-1114 




<"Store 104"-user2> 


User2 " 


<Terhplote'l005> 


214-555-6666 




<"Home*'-user2> 


User2 


<Templote1006> 


770-555-1212 




<"Mom ,, -user2> 


• * • 










User3 


<Templote2001 > 


214-555-1111 


> 


<"Store 101"-user3> 


User3 


<Templote2002> 


214-555-1112 




<"Store 102"-user3> 


User3 


<Template2003> 


214-555-1113 




<"Store 103"-user3> 


User3 

i 1 


<Templote2004> 


214-555-1114 


► 


<"Store 104"-user3> 



DATABASE l/F 



l SPEECH RECOGNITION 
' PROCESSING 



[other speech"! i . A 

I RECOGNITION i K~ 
' PROCESSING I j J 
' SYSTEMS Jj 

7 ' 

14 



BNSDOCID: <WO 0039982A2. 1_> 




WO 00/39982 



PCT/US99/31089 



2/3 



36 



FIG. 2 



USER 


TEMPLATE 


NUMBER 


Userl 


<Templote1> 


214-555-1111 


Userl 


<Templote2> 


214-555-1112' 


Userl 


<Template3> 


214-555-11.13. . 


Userl 


<Template4> 


214-535-VU4jvi 


Userl 


<Template5> 


214-555-5555 


• • • 






User2 


<Templote1001> 


2 14-535- Vm*.}- 


User2 


<Templote1002> 


214-555-1112 


User2 


<TemplQLe.1.QD3>. . 


214.-555^13' 


User2 


<Template1004> 


214-555-1114 


User2 

r^r*ri 


<Tempfate1005> 


214i-55 l 5-5666- 


-User2 f 


<TempfoteJQ06>''' : 


770-55(5-1 2 V2 T 


• ■ • 






User3 


<Template2001> 


214-555-1111 


User3 


<Template2002> 


214-555-1112 


User3 


<Template2003> 


214-555-1113 


User3 


<Templote2004> 


214-555-1114 



34 



,fW: < ,, H6rjrve;*euser2> 



30 



RECORDING 



<"Store 101"-master> 



<"Store 102"-moster> 



-<"Store 103"-master> 



<"Store i04 M -master> 



<"Home"-user1> 



^"Mbm"-user2> 





38 




/ 




DATABASE 1/F 




SPEECH RECOGNITION 


PROCESSING 




BNSDOC1D: <WO OQ39982A2J_> 



WO 00/39982 PCT/US99/31089 

~ 3 / 3 



FIG. 3 



. .. --so 



56- 



IDENtlFY USER; PROMPT" 
F0R"VAD COMMAND 



RECEIVE iAND [RECORD; 

....„_ GET ASSOCIATED 

; TELEPHONE!- NUMBER - 




7ALREADYWES 
'STORED?> 



SAVE RECORDING 1 
FOR CONFIRMATION 



15 



>:v kINK TO -MASTER ; Z 
- REGORDlN&-(Gfl-PREVJ&£bY- 
ENROLIiED 7 RECORDtNG) I 



58 



BNSDOCID: <WO 0039982A2J_> 



(12) INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT) 



(19) World Intellectual Property Organization 

International Bureau 

(43) International Publication Date 
6 July 2000 (06.07.2000) 




PCT 



(10) International Publication Number 

WO 00/39982 A3 



(51) International Patent Classification 7 : H 04 M 1/27 

(21) International Application Number: PCT/US99/31089 

(22) International Filing Date: 

28 December 1999 (28.12.1999) 



(25) Filing Language: 

(26) Publication Language: 



English 
English 



(30) Priority Data: 

09/222,746 29 December 1 998 (29. 1 2. 1 998) US 

(71) Applicant: ALCATEL USA SOURCING, L.P. [US/US]; 
1000 Coit Road, Piano. TX 75075 (US). 

(72) Inventor: SCHIER, John; 11748 DK Ranch Road, 
Austin. TX 78759 (US). 



(74) Agent: LINTEL, Alan, W.; Anderson, Levine & Lintel, 
L.L.P., Suite 1 1 1, 12160 Abrams Road, Dallas. TX 75243 
(US). 

(81) Designated States (national): AE. AL, AM, AT, AU. AZ, 
BA, BB, BG, BR, BY, CA, CH, CN, CU, CZ, DE, DK, EE, 
ES, FL GB, GD, GE, GH, GM, HR. HU, ID, IL, IN, IS, JP, 
KE, KG, KP, KR, KZ, LC, LK, LR, LS, LT. LU, LV, MD, 
MG, MK, MN, MW, MX, NO, NZ, PL, PT, RO, RU, SD, 
SE. SG, SI, SK, SL, TJ, TM, TR, TT, UA, UG, UZ. VN, 
YU, ZA. ZW. 

(84) Designated States (regional): ARIPO patent (GH, GM, 
KE. LS, MW, SD, SL, SZ, TZ. UG, ZW), Eurasian patent 
(AM, AZ, BY, KG. KZ, MD, RU, TJ. TM), European patent 
(AT, BE, CH, CY, DE, DK, ES, FI, FRfGB, GR, IE, IT, LU, 
MC, NL, PT, SE), OAP1 patent (BF, BJ, CF, CG, CI, CM, 
GA, GN, GW, ML, MR, NE, SN, TD, TG). 

[Continued on next page] 



(54) Title: VOICE ACTIVATED DIALING WITH REDUCED STORAGE REQUIREMENTS 



< 

On 
ON 



O 



34- 



36 



USER 


TEMPLATE 


NUMBER 


Userl 


<7emptotel> 


21 4-555- 1111 


Userl 


<Temptate2> 


214-555-1112 


Userl 


<Template3> 


214-555-1113 


Userl 


<Template4> 


214-555-1114 


Userl 


<Temptote5> 


214-555-5555 








User2 


<Temptote1001> 


2M-555-1111 


User2 


<Templotel002> 


214-555-1112 


User2 


<Template1003> 


214-555-1113 


User2 


<Template1004> 


214-555-1114 


User2 


<Template100S> 


214-555-6666 


User2 


<Templote1006> 


770-555-1212 








User3 


<Tempkite200i> 


214-555-1111 


User3 


<Temp»ote2002> 


214-555-1112 


User3 


<Temp(ote2003> 


214-555-1113 


User3 


<Templote2004> 


214-555-1114 





38 








DATABASE l/r | 


SPEECH RECOGNITION 


PROCESSING 



r-»- <"Home"-user2> 



rn <"Mom"-u$er2> 



30 



RECORDING 



<"Store 10 V- master > 



<"Store 102"-master> 



<"Store 103"-moster> 



<"Store 104"-moster> 



<"Home"-user1> 




(57) Abstract: A voice activated dialing system (30) 
includes a database (36) maintaining a plurality of 
speech templates, associated telephone numbers, and 
recordings, wherein multiple speech templates can be 
linked to a single master recording. Processing circuitry 
(34) compares an utterance from a user to one or more 
of the templates from the database to find a matching 
template. When a match is found, the processing 
circuitry retrieves the telephone number associated 
with the matching template and the corresponding 
recording. The recording is replayed to the user prior 
to initiating a connection to the associated telephone 
number. Master recordings can be used for multiple 
entries which are associated with a single telephone 
number to reduce storage requirements. 



BNSDOCID: <WO__. 0039982A3J. > 



WO 00/39982 --A3 



Published: For two-letter codes and other abbreviations, refer to the n Guid- 

— With international search report ance Notes on Codes and Abbreviations" appearing at the begin- 

— Before the expiration of the time limit for amending the-„ning.ofeach regularjssue of the PCT Gazette. „JZ . 
claims and to be_ republished in the event of receipt of ■ 1 '- : - 
amendments. - ... .-.■.*■* 

(88) Date of publication of the international search report: 

18 January 2001 



2ii ; f . Z n^uf: :- sr- ' 



BNSDOCID: <WO 0O39982A3_l_> 



INTERNATIONAL SEARCH REPORT 



Interrv *al Application No 

PCT/US 99/31089 



A. CLASSIFICATION OF SUBJECT MATTER 

IPC 7 H04M1/27 



According to International Patent Classification (IPC) or to both national classification and IPC, 



B. FIELDS SEARCHED 



Minimum documentation searchea (classification system followed by classification symbols) 

IPC 7 H04M 



Documentation searched other than minimum documentation to the extent that such documents are included in the fields searched 



Electronic data base consulted during the international search (name of data base and, where practical, search terms used) 

EPO-Internal , WPI Data, PAJ 



C. DOCUMENTS CONSIDERED TO BE RELEVANT 



Category * 


Citation of document, with indication, where appropriate, of the relevant passages 


Relevant to claim No. 


X 


EP 0 762 709 A (TEXAS INSTRUMENTS INC) 


1-3,5-8, 




12 March 1997 (1997-03-12) 


10,11 




abstract 






column 2, line 29 -column 3, line 42 






column 6, 1 ine 11-18 






figures 1-5,7 




Y 




4,9 


Y 


EP 0 477 688 A (TEXAS INSTRUMENTS INC) 


4,9 




1 April 1992 (1992-04-01) 






abstract 






figure 2 




A 


EP 0 866 595 A (ERICSSON TELEF0N AB L M) 


1,6 




23 September 1998 (1998-09-23) 






abstract 






column 2, line 21 -column 4, line 13 






figures 1-5 






-/-- 





Further documents are listed in the continuation of box C. 



Patent family members are listed in annex. 



* Special c a teg ones of cited documents : 

*A' document defining the general state of the art which is not 
considered to be of particular relevance 

'E' earlier document but published on or after the international 
filing date 

"L" document which may throw doubts on priority ciaim(s) or 
wmch is cited to establish the publication date of another 
citation or other special reason (as specified) 

"O* document referring to an oral disclosure, use. exhibition or 
other means 

"P" document published prior to the international filing date but 
later than the priority date d aimed 



T" later document published after the international filing date 
or priority date and not in conflict with the application but 
cited to understand the principle or theory underlying the 
invention 



"X" 



"Y" 



document of particular relevance; the claimed invention 
cannot be considered novel or cannot be considered to 
involve an inventive step when the document is taken alone 

document of particular relevance; the claimed invention 
cannot be considered to involve an inventive step when the 
document is combined with one or more other such docu- 
ments, such combination being obvious to a person skilled 
in the art. 

document member of the same patent family 



Date of the actual completion of the international search 

30 October 2000 


Date of mailing of the international search report 

06/11/2000 


Name and mailing address of the ISA 

European Patent Office. P.B. 581 B Patentlaan 2 
NL - 2280 HV Rijswijk 
Tel. (+31-70) 340-2040. Tx. 31 651 epo nl. 
Fax; (+31-70) 340-3016 


Authorized officer 

Golzio, D 



Form PCT1SA/210 (second sneoti (July 1 992) 
BNSDOCID: <WO O039982A3J_> 



page 1 of 2 



INTERNATIONAL SEARCH REPORT 



Intern? al Application No 

PCT/US 99/31089 



C.(Continuatton) DOCUMENTS CONSIDERED TO BE RELEVANT 



Category ■ Citation of document, with indication, where appropriate., of the relevant passages 



Relevant to claim No. 



US 5 832 063 A CVYSOTSKY GEORGE J ET.AL) 
3 November 1998 : (1998-il-03) u . ■ 

figure 1 



1,6 



1 



Form PCT/!SA,210 < continuation oi second sheen {Jmy ^99Z) 
BNSDOCID. <WO 0039982A3J_> 



page 2 of 2 



INTERNATIONAL. SEARCH REPORT 

.. . -... 4 .. ' 

jrmatlon on patent family members 



Intern' . <al Application No 

PCT/US 99/31089 



Patent document 
cited in search report 


Publication- 

date 


- .. ... p a tent family ~ . 
_ member(s) 


• Publication 
date . 


EP 0762709 


A.. 


12-03-1997 _ 


CA 


2185262 A 
. 9205478 A 


13-03-1997 
05-08-1997 


EP 0477688 


A 


01-04-1992 


US 
DE 
DE 
JP 


" 5165095 ? A 
69129883 D 
69129883 T 
6085893 A 


" ; 17-11-1992 
* 03-09-1998 
25-02-1999 
25-03-1994 


EP 0866595 


A 


23-09-1998 


GB 
AU 
NO 


2323497 A 
5843998 A 
981075 A 


23- 09-1998 

24- 0Q-1 QQfl 

C *T 1770 

21-09-1998 


US 5832063 


A 


03-11-1998 


US 
US 
US 
US 


5719921 A 
6076054 A 
5895448 A 
5842165 A 


17-02-1998 
13-06-2000 
20-04-1999 
24-11-1998 



. Form PCT/1SA/210ipatenttarni»y annex) (-M.yj9S£) 
BNSDOCID: <WO 0O39982A3J_> 



This Page is Inserted by IFW Indexing and Scanning 
Operations and is not part of the Official Record 

BEST AVAILABLE IMAGES 

Defective images within this document are accurate representations of the original 
documents submitted by the applicant. 

Defects in the images include but are not limited to the items checked: 

□ BLACK BORDERS 

□ IMAGE CUT OFF AT TOP, BOTTOM OR SIDES 
□Tfaded TEXT OR DRAWING 

□ BLURRED OR ILLEGIBLE TEXT OR DRAWING 

□ SKEWED/SLANTED IMAGES 

□TCOLOR OR BLACK AND WHITE PHOTOGRAPHS 

□ GRAY SCALE DOCUMENTS 

□ LINES OR MARKS ON ORIGINAL DOCUMENT 

□ REFERENCE(S) OR EXHIBIT(S) SUBMITTED ARE POOR QUALITY 

□ OTHER: ; ; 

IMAGES ARE BEST AVAILABLE COPY. 
As rescanning these documents will not correct the image 
problems checked, please do not report these problems to 
the IFW Image Problem Mailbox. 



4& 



